skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Szpiech, Zachary A"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. The Kirtland's warbler (Setophaga kirtlandi) is a rare migratory passerine species and habitat specialist of the North American Jack Pine Forests. Their near extinction in the 1970s classified them as endangered and protected under the Endangered Species Act of 1973. After decades of intense conservation management, their population size recovered, and they were delisted from federal protection in 2019. We explore the genomic consequences of this harsh bottleneck and recovery by comparing the genomic architecture of two closely related species whose population sizes have remained large and stable, Hooded Warblers (Setophaga citrina) and American Redstarts (Setophaga ruticilla). We used whole‐genome sequencing to characterize the distribution of runs of homozygosity and deleterious genetic variation. We find evidence that Kirtland's warblers exhibit genetic patterns consistent with recent inbreeding. Our results also show that Kirtland's warblers carry an excess proportion of deleterious variation, which could complicate management for this conservation‐reliant species. This analysis provides a genetically informed perspective that should be thoroughly considered when delisting other species from federal protections. Through the increasing accessibility of genome sequencing technology, it will be more feasible to monitor the genetic landscape of recovering populations to ensure their long‐term survival independent of conservation intervention. 
    more » « less
  2. Bachman’s warbler (Vermivora bachmanii)—last sighted in 1988—is one of the only North American passerines to recently go extinct. Given extensive ongoing hybridization of its two extant congeners—the bluewinged warbler (V. cyanoptera) and golden-winged warbler (V. chrysoptera)—and shared patterns of plumage variation between Bachman’s warbler and hybrids between those extant species, it has been suggested that Bachman’s warbler might have also had a component of hybrid ancestry. Here, we use historic DNA (hDNA) and whole genomes of Bachman’s warblers collected at the turn of the 20th century to address this. We combine these data with the two extant Vermivora species to examine patterns of population differentiation, inbreeding, and gene flow. In contrast to the admixture hypothesis, the genomic evidence is consistent with V. bachmanii having been a highly divergent, reproductively isolated species, with no evidence of introgression. We show that these three species have similar levels of runs of homozygosity (ROH), consistent with effects of a small long-term effective population size or population bottlenecks, with one V. bachmanii outlier showing numerous long ROH and a FROH greater than 5%. We also found—using population branch statistic estimates—previously undocumented evidence of lineage-specific evolution in V. chrysoptera near a pigmentation gene candidate, CORIN, which is a known modifier of ASIP, which is in turn involved in melanic throat and mask coloration in this family of birds. Together, these genomic results also highlight how natural history collections are such invaluable repositories of information about extant and extinct species. 
    more » « less
  3. Buerkle, Alex (Ed.)
    The inference of positive selection in genomes is a problem of great interest in evolutionary genomics. By identifying putative regions of the genome that contain adaptive mutations, we are able to learn about the biology of organisms and their evolutionary history. Here we introduce a composite likelihood method that identifies recently completed or ongoing positive selection by searching for extreme distortions in the spatial distribution of the haplotype frequency spectrum along the genome relative to the genome-wide expectation taken as neutrality. Furthermore, the method simultaneously infers two parameters of the sweep: the number of sweeping haplotypes and the “width” of the sweep, which is related to the strength and timing of selection. We demonstrate that this method outperforms the leading haplotype-based selection statistics, though strong signals in low-recombination regions merit extra scrutiny. As a positive control, we apply it to two well-studied human populations from the 1000 Genomes Project and examine haplotype frequency spectrum patterns at the LCT and MHC loci. We also apply it to a data set of brown rats sampled in NYC and identify genes related to olfactory perception. To facilitate use of this method, we have implemented it in user-friendly open source software. 
    more » « less
  4. Abstract In studying allele-frequency variation across populations, it is often convenient to classify an allelic type as “rare,” with nonzero frequency less than or equal to a specified threshold, “common,” with a frequency above the threshold, or entirely unobserved in a population. When sample sizes differ across populations, however, especially if the threshold separating “rare” and “common” corresponds to a small number of observed copies of an allelic type, discreteness effects can lead a sample from one population to possess substantially more rare allelic types than a sample from another population, even if the two populations have extremely similar underlying allele-frequency distributions across loci. We introduce a rarefaction-based sample-size correction for use in comparing rare and common variation across multiple populations whose sample sizes potentially differ. We use our approach to examine rare and common variation in worldwide human populations, finding that the sample-size correction introduces subtle differences relative to analyses that use the full available sample sizes. We introduce several ways in which the rarefaction approach can be applied: we explore the dependence of allele classifications on subsample sizes, we permit more than two classes of allelic types of nonzero frequency, and we analyze rare and common variation in sliding windows along the genome. The results can assist in clarifying similarities and differences in allele-frequency patterns across populations. 
    more » « less
  5. Apes possess two sex chromosomes—the male-specific Y chromosome and the X chromosome, which is present in both males and females. The Y chromosome is crucial for male reproduction, with deletions being linked to infertility1. The X chromosome is vital for reproduction and cognition2. Variation in mating patterns and brain function among apes suggests corresponding differences in their sex chromosomes. However, owing to their repetitive nature and incomplete reference assemblies, ape sex chromosomes have been challenging to study. Here, using the methodology developed for the telomere-to-telomere (T2T) human genome, we produced gapless assemblies of the X and Y chromosomes for five great apes (bonobo (Pan paniscus), chimpanzee (Pan troglodytes), western lowland gorilla (Gorilla gorilla gorilla), Bornean orangutan (Pongo pygmaeus) and Sumatran orangutan (Pongo abelii)) and a lesser ape (the siamang gibbon (Symphalangus syndactylus)), and untangled the intricacies of their evolution. Compared with the X chromosomes, the ape Y chromosomes vary greatly in size and have low alignability and high levels of structural rearrangements—owing to the accumulation of lineage-specific ampliconic regions, palindromes, transposable elements and satellites. Many Y chromosome genes expand in multi-copy families and some evolve under purifying selection. Thus, the Y chromosome exhibits dynamic evolution, whereas the X chromosome is more stable. Mapping short-read sequencing data to these assemblies revealed diversity and selection patterns on sex chromosomes of more than 100 individual great apes. These reference assemblies are expected to inform human evolution and conservation genetics of non-human apes, all of which are endangered species. 
    more » « less
  6. ABSTRACT We present haplotype-resolved reference genomes and comparative analyses of six ape species, namely: chimpanzee, bonobo, gorilla, Bornean orangutan, Sumatran orangutan, and siamang. We achieve chromosome-level contiguity with unparalleled sequence accuracy (<1 error in 500,000 base pairs), completely sequencing 215 gapless chromosomes telomere-to-telomere. We resolve challenging regions, such as the major histocompatibility complex and immunoglobulin loci, providing more in-depth evolutionary insights. Comparative analyses, including human, allow us to investigate the evolution and diversity of regions previously uncharacterized or incompletely studied without bias from mapping to the human reference. This includes newly minted gene families within lineage-specific segmental duplications, centromeric DNA, acrocentric chromosomes, and subterminal heterochromatin. This resource should serve as a definitive baseline for all future evolutionary studies of humans and our closest living ape relatives. 
    more » « less